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rr 1 ■ Abstract 

o 

' We study the problem of metrical service systems with multiple servers (MSSMS), which 

generalizes two well-known problems - the /c-server problem, and metrical service systems. 
The MSSMS problem is to service requests, each of which is an Z-point subset of a metric 
space, using k servers, with the objective of minimizing the total distance traveled by the 
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' servers. 

Feuerstein |14] initiated a study of this problem by proving upper and lower bounds on the 
deterministic competitive ratio for uniform metric spaces. We improve Feuerstein's analysis 
^"*^ ' of the upper bound and prove that his algorithm achieves a competitive ratio of k (C 1 ^ 1 ) — lj ■ 

In the randomized online setting, for uniform metric spaces, we give an algorithm which 
achieves a competitive ratio 0(k 3 logZ), beating the deterministic lower bound of — 1. 
We prove that any randomized algorithm for MSSMS on uniform metric spaces must be 
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fi(log fcZ)-competitive. We then prove an improved lower bound of ( +2 k 1 ) — ( I 1 ) on the 
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. competitive ratio of any deterministic algorithm for (k, Z)-MSSMS, on general metric spaces. 

In the offline setting, we give a pseudo-approximation algorithm for (k, Z)-MSSMS on general 
metric spaces, which achieves an approximation ratio of I using kl servers. We also prove a 
matching hardness result, that a pseudo- approximation with less than kl servers is unlikely, 



even for uniform metric spaces. For general metric spaces, we highlight the limitations of a 
few popular techniques, that have been used in algorithm design for the fc-server problem 
and metrical service systems. 

> : 

j_l ! 1 Introduction 

.9. 

The problem of metrical service systems with multiple servers (MSSMS) is a generalization of 
two well-known problems - the /c-server problem [21] and metrical service systems (MSS) [§ll24|. 
These problems share a common paradigm, that there is an underlying metric space and requests 
are to be served by moving servers on the metric space, in such a way that the total distance 
traveled by the servers is minimized. For a problem in the online setting, the input is revealed 
to an algorithm piece by piece. An algorithm is constrained to make irrevocable decisions on 
seeing each piece. In case of the aforementioned problems, each piece in the input is a request, 
and the irrevocable decisions are the movements of the servers. A (possibly randomized) online 
algorithm is said to be c-competitive if, on every input, it returns a solution whose (expected) 
cost is at most c times the cost of the optimum solution for that input. The book by Borodin 
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and El-Yaniv [5j gives a nice comprehensive introduction to online algorithms and competitive 
analysis. 

The /c-server problem 

The /c-server problem of Manasse et. al. [M] is, arguably, the most famous among the 
problems that are naturally posed in the online setting. The following quote by Koutsoupias, in 
his beautiful survey on the /c-server problem |21j . upholds the importance of this problem. 

The k- server problem is perhaps the most influential online problem: natural, crisp, 
with a surprising technical depth that manifests the richness of competitive analysis. 
The k-server conjecture, which was posed more than two decades ago when the prob- 
lem was first studied within the competitive analysis framework, is still open and has 
been a major driving force for the development of the area online algorithms. 

In the /c-server problem, we have k servers occupying points in a metric space. Each request 
is a point in the metric space. To serve the request, one of the servers has to be moved to 
the requested point. The /c-server conjecture referred to in the quote states that there is a 
/c-competitive algorithm for the /c-server problem. 

Manasse et. al. |24] proved a lower bound of k on the competitive ratio of any deterministic 
algorithm on any metric space with more than k points. They proved that the competitive 
ratio is k only for very specific cases, and posed the /c-server conjecture. The conjecture has 
been shown to hold for a few metric spaces, for example, the line [8] and tree metric spaces 
|10| . Fiat, Rabani and Ravid |16| were the first to give an algorithm for the /c-server problem, 
with competitive ratio bounded by a function of k, which was later improved by Grove \17\ H]. 
The breakthrough result was due to Koutsoupias and Papadimitriou, who proved that an al- 
gorithm, first proposed by Chrobak and Larmore, and called the Work Function Algorithm, is 
(2/c — l)-competitive [22 i. In case of randomized algorithms, the best known lower bound for 
every metric space is fJ(log kj log log k) due to Bartal et. al. [3], and there do exist metric spaces 
with a lower bound more than Hj, [18]. No better algorithm than the Work Function Algorithm 
is known, even with randomization. The randomized /c-server conjecture states that there exists 
a randomized algorithm for the /c-server problem with competitive ratio C(log k) on any metric 
space. Recent developments, which ingeneously adapt the primal-dual framework to the online 
setting, have been applied to the /c-server problem, culminating in a 0(log /c)-competitive ran- 
domized algorithm for star metrics [2], and a 0(poly log(fc) poly log(n))-competitive randomized 
algorithm on metric spaces of size n pQ . 
Metrical Service System 

The term Metrical Service System (MSS) was coined by Chrobak and Larmore [TT] for the 
following problem. We have a single server in an underlying metric space. Each request is a set 
of I points from the metric space, where I is called the width, and is a parameter to the problem. 
To serve a request, the server has to be dispatched to one of the requested points. 

Finding shortest paths is a fundamental problem in the offline setting. In the online setting 
it is posed as the problem of Layered Graph Traversal (LGT). This problem, introduced by 
Papadimitriou and Yannakakis [25], was a precursor of MSS. Fiat et. al. [15J proved that MSS 
and LGT are in fact equivalent problems. That is, there is a c-competitive algorithm for MSS 
if and only if there is a c-competitive algorithm for LGT. They also proved that there exists a 
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metric space on which the competitive ratio of any deterministic algorithm for the MSS problem 
is Q(2 l ). Further, they gave an C(9')-competitive algorithm. They proved that 1/2 is a lower 
bound on the competitive ratio of any randomized algorithm for LGT. Ramesh [26J gave a better 
deterministic algorithm for LGT, which achieves a competitive ratio of l 3 2 l , and a randomized 
Z 13 -competitive algorithm. He proved that there exists a metric space on which any randomized 
algorithm must have competitive ratio f£(Z 2 / log 1+e /), for any e > 0. Burley [7] proved that a 
variant of the Work Function Algorithm (WFA) is 0(1 ■ 2')-competitive for the MSS (and hence, 
the LGT) problem. For the uniform metric space, Chrobak and Larmore proved a lower 
bound of / on the competitive ratio of any deterministic algorithm, and also gave an algorithm 
achieving this bound. It is easily seen that the lower bound holds for any metric space with at 
least I + 1 points. 

Metrical Service System with Multiple Servers 

A natural generalization of both the A:-server problem and metrical service system, has k 
servers on an underlying metric space, and each request is a set of I points from the metric 
space. To serve a request, one of the k servers has to move to one of the I requested points. 
We call this problem Metrical Service System with Multiple Servers, with parameters k and 
I ((fc, Z)-MSSMS). Feuerstein [TJ] studied this problem for uniform metric spaces and called it 
the Uniform Service System with parameters k and I (USS(fc, I)). He proved a lower bound of 
( fc fc — 1 on t ne competitive ratio of any deterministic algorithm for this problem. In fact, this 
proof holds for any arbitrary metric space with at least k + I points. Feuerstein also gave an 
algorithm and proved that its competitive ratio is k ■ min ( \Z% ; Eto I* + l k ) ■ He concluded 
the paper with the following comment. 

An interesting subject of future research is to extend USS(k, lj^ to non-uniform metric 
spaces. This would extend both the work in this paper and the work by Chrobak and 
Larmore fSj on Metrical Service Systems, where only one server is considered. 

Our results: In Section [2] we present a simple analysis of Feuerstein's algorithm, which im- 
proves the bound on its competitive ratio proved in [T5j, to k ■ f ( fe ^) — 1^ • In Section [3] we give 
a 0(k 3 log Z)-competitive randomized algorithm on uniform metric spaces, which beats the de- 
terministic lower bound by an exponential factor. We also give an 0(log kl) lower bound on the 
competitive ratio of any randomized algorithm. We then give a lower bound of ( fc+2 fc z " 1 ) - , 
on the competitive ratio of any deterministic algorithm for (k, Z)-MSSMS, in Section SI which 
improves the bound of ( fc +') - 1 in [U] for all k, I > 2. In Section we consider the offline 
(k, Z)-MSSMS problem, where we give a pseudo-approximation by a factor of I using kl servers. 
We prove a matching lower bound, assuming the Unique Games Conjecture. We discuss the lim- 
itations of some well-studied techniques, towards obtaining online algorithms for (k, Z)-MSSMS, 
in Section [6l We conclude with a number of interesting open problems in Section \7\ 



1 Feuerstein used 'to' for the width parameter, while we use T 
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2 Uniform metric spaces: The Hitting Set Algorithm 



In this section we analyze an algorithm for (k, /)-MSSMS on uniform metric spaces given by 
Feuerstein [14j . which he calls the Hitting Set algorithm. Feuerstein proved that the competitive 
ratio of this algorithm is at most k ■ min 

(fer>£?=o r + l k ), whereas we prove a bound of 

k • ^ ( k ~l l ) — l^j , which is better asymptotically. 

The Hitting Set (HS) algorithm can be described as follows. HS divides the request sequence 
into phases, the first phase starting with the first request of the sequence. We say that a request 
produces a fault whenever the requested set of points is disjoint from the set of points occupied 
by the servers. Each time a request produces a fault, the algorithm behaves as follows. First, it 
computes a minimum cardinality set H of points that intersects all the requests that produced 
a fault during the current phase. If \H\ < k then any \H\ servers are made to occupy all points 
in H. Otherwise, if \H\ > k, then the phase terminates and a new phase begins with the current 
request. 

Theorem 1. HS is an k • (^( k ^ 1 ) — l^j -competitive algorithm for (k,l)-MSSMS. 

Proof. Feuerstein observed that the adversary must incur a cost of at least one per phase. He 
then proved that at most min {JjtEy, Y^l=o ^ + lj requests can produce faults in any phase. We 
improve this upper bound to ( k j l ) - 1, and our claim follows, since the algorithm pays at most 
k for every request that produces a fault. 

Let Ai, . . . , A r _\ be the requests that produced a fault in a given phase, and let A r be the 
first request in the next phase, which also must produce a fault. Let Bi be the set of points 
occupied by the servers when the request Ai was given. Since Ai produced a fault, we have for 
every i with 1 < i < r 

Ai n Bi = (1) 

On the other hand, since no request between j4«_i and Ai produced a fault, the hitting set 
chosen to serve ^4j_i must be Bi itself. By the definition of the algorithm, for any i, j with 
1 < j < i < r we have 

Aj DBi^f/i (2) 

The skew Bollobds theorem due to Lovasz [23] asserts that for any set system satisfying ([1]) and 
([2]) with \Ai\ = k and \Bi\ = I, the number of pairs (Ai, Bi) is at most ( k ^ 1 )- Thus, the number 
of requests in the phase, that produced a fault, is r — 1 < ( k t l ) — 1, as required. □ 

3 Uniform metric spaces: Randomized bounds 
3.1 The Randomized Hitting Set Algorithm 

In this section we give a randomized version of the Hitting Set algorithm from Section [21 which 
we call the Randomized Hitting Set (RHS) algorithm, and prove that its competitive ratio is 
C(fc 3 log/). The algorithm is as follows. 

RHS divides the sequence of requests in phases, just like the Hitting Set algorithm. Each time 
a request produces a fault, the algorithm behaves as follows. First, it computes the minimum 
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cardinality, s, of a set of points that intersects all the requests in the current phase given so far. 
If s < k, it chooses a set H uniformly at random from the collection of all the hitting sets of size 
s, and then any s servers are made to occupy all points in H. Otherwise, if s > k the current 
phase ends and a new phase begins with the current request. 

As before, the adversary must incur a cost of at least 1 per phase. We prove that the 
expected cost of RHS is 0(A; 3 log/) per phase. Note that the value of s increases from 1 to k 
as the algorithm progresses in a phase. We divide the phase into k sub-phases, where the i th 
sub-phase is the part of the phase when the value of s is equal to i. We will require the following 
combinatorial lemma. 

Lemma 2. Let s be the size of the smallest hitting set of an l-uniform set system S. Then the 
number of minimum hitting sets of S is at most I s . 

Proof. We prove the lemma by induction on s. For s = 1 the claim is obvious. For a general s, 
let H be the collection of all minimum hitting sets of S. Let A' be an arbitrary set in S. For 
each e E A' , let 

S e = {AeS | e<£A} 
U e = {B\{e} | B eH, e E B} 

Then each set in H e hits each set in S e , so the size of smallest hitting set of S e is at most s — 1. 
Further, if S e has a hitting set H of size less than s—1, then HL){e} would be a hitting set of S of 
size less than s. This is a contradiction. Thus, the size of the smallest hitting set of S e is exactly 
s — 1. Therefore by induction, we have \H e \ < I s-1 , for all e E A'. Now each set in H intersects A' 
in some e and therefore contributes to H e for some e. Thus \H\ < J2eeA> l^e| < I ■ I s 1 = I s - D 

Theorem 3. RHS is an O(k^logl) competitive algorithm for (k,l)-MSSMS. 

Proof. Consider the s'th sub-phase of any phase. Let A\, . . . ,A r be the sets requested in this 
sub-phase and let Bi be the set of locations occupied by the servers, just before the set A4 
was requested. Let Sq be the collection of sets requested in the current phase before the s'th 
sub-phase started (that is, before A± was requested), and let Si = Sq U {A\, . . . , A4}. Let Hi be 
the collection of all hitting sets of Si, having size s. Thus, each Tii is an s-uniform set system, 
and we have 

Sq C S\ C • ■ ■ C S r 
Hq 5 Hi 5 • • • 5 U r 

Further, if Bi € Hi then Bi + \ = Bi, else Bj+i is picked uniformly at random from Hi, due to the 
construction of the algorithm. For each Bj there exists a unique tj such that Bj G %t -l 
Then either tj > j and -Bj+i = Bj, or tj = j and Bj+\ is picked uniformly at random from Hj. 
The number of requests for which the algorithm shifts servers, is thus one less than the size of 
the set T = {tj \ 1 < j < r}. We next bound the expected size of T. 

Let hi = \Hi\, so that hi > h% > ■ ■ ■ > h r . We know B\ E Ho \ Hi, since the current phase 
started with A\ which must be disjoint from B\. Thus 1 E T with probability 1. 

2 Assume H r +i = 0- 
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We claim that for i > 1, Pr[i G T] = (/ij-i — hi)/hi-\. For j < i we say that the event 
.Eji has occurred if i,j G T but no i' with j < i' < i is in T. This event occurs exactly when 
-Bj ^ "Hj, that is, Bj n = 0, and the next set Bj+i, chosen uniformly at random from T~Lj, 
turns out to be in Hi-i \ %%■ Thus, we have 

Pr^ | j G T] 



hi- 



i-1 



(3) 



Note that for a fixed i, the events Eji are pairwise disjoint and i G T iff Eji occurs for some 
j < i. Thus we have Pr[i G T] = Ej=i Pr [£ji] = Ei=i Pr [% I J G ^]Pr[j G T]. We proceed 
by induction on i. As the base case, we have for i = 2, Pr[2 G T] = Pr[£?i2] = {h\ — h-zj/hx, as 
required. For the inductive step, by induction hypothesis and equation ([3]) we have 



Pr[i G T] 



hi- 



i-l 



hi 



i-l 
3=2 



hi— i — /ij hj—i — /ij 



i-l 



/i ; 



i-l 



3=2 



^3-1 
1 



(/l 



i-l 



H-l 



hi 



hi 



H-l 



The expected size of T is thus given by 



1 ^ 1 I , 



3=2 



4-1 



E[|T|] = ^P r [i GT ] = l + ^ 



H-l 



i=l 



< 



i=2 



+ 



i=2 
1 



i-l 



ft 



H-l 



+ ••• + 



1 



i-l 



^ + 1 



hi 

3=1 J 



Now, /ii = I'Hi) and %i is the collection of all minimum hitting sets of the Z-uniform set 
system Si, with minimum hitting set size s. Thus, by Lemma [21 hi < I s and hence E[|T|] as 
well as the expected number of faults in the s'th sub-phase, is O(slogZ). The cost incurred, for 
every request which produces a fault, is at most s, and hence the total cost is 0(s 2 log I), in the 
s'th sub-phase. Summing over s from 1 to k, we infer that the cost incurred in an entire phase 
is 0(k 3 logl). □ 

3.2 Lower Bound for Randomized Online Algorithms 

In this section we will prove a lower bound on the competitive ratio of randomized algorithms, 
against an oblivious adversary^ for (k, /)-MSSMS on uniform metric spaces. We will use a form 
of Yao's principle [28]. The required form of this principle is as follows [61 127j. 

Theorem 4 (Yao's principle). Let 21 denote the set of deterministic online algorithms for an 
online minimization problem. If P is a distribution over input sequences such that 

inf E [A(p)\ > c • E [OPTO)] 

for some real number c > 1, then c is a lower bound on the competitive ratio of any randomized 
algorithm against an oblivious adversary. 



5 An oblivious adversary is an adversary who does not have access to the random bits used by the algorithm. 
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Theorem 5. The competitive ratio of any randomized online algorithm for (k,l)-MSSMS is 
ft (log kl). 

Proof. We use Theorem [U the required distribution P is given by the following random process. 
Let Ai be the uniform metric space with k + I points, and let -Bo be the set of points occupied 
by the servers initially. Let S be the collection of all sets of points B such that \B\ = k and 
\B\Bq\ = 1. We generate random subsets of size I and request them, until the complements 
of all sets in S have been picked. We do not request the last such set picked, say A. Note that 
A E S, and A does not appear in the input. Let p denote this random input. 

The adversary can serve the request sequence p, simply by shifting the servers to the points 
in A. Since the metric space has only k + I points, A intersects with all the sets that appear in 
p. Further, since AeS, \ A\B Q \ = 1. Thus OPT(p) < 1. 

The algorithm incurs a cost exactly when the request is the complement of the set of positions 
occupied by its servers. This happens with probability l/( fc ^), and hence the expected cost 
incurred by the algorithm for any request is . Let X{ be the random variable representing 

the cost on the i th request, and Y be the random variable representing the number of requests 
in p. Then E p ~p [X t \Y > i] = l/{ k f) and E p ~p [Xt\Y < i] = 0. Thus, E p ~p [X t ] = l/{ k f) ■ 
Pr p ~p [Y > i], and therefore, the expected total cost is given by 

By the coupon collector argument, the expected number of requests in p is ( fc ^) ■ — 1 = 

( k f l )-H(kl)-l. Thus the expected cost incurred by the algorithm is (( k + l ) ■ H{kl) - l) / i^f) = 
Q(logH). ' ' □ 



4 General metric spaces: Lower bound 

In this section we prove a lower bound on the competitive ratio of any algorithm for (k, l)- 
MSSMS. We give a metric space with two distances, on which an adversary can force any 
algorithm to perform poorly. Our lower bound is strictly better than Feuerstein's lower bound 
P3] of ( fe + z ) - 1, for all jfe > 1, I > 1. The basic idea of this proof, is that the adversary runs 
many algorithms in parallel with the online algorithm, in such a way that the total cost incurred 
by these algorithms, is at most the cost incurred by the online algorithm. This idea is due to 
Manasse et. al. [23], however, our analysis is much more involved. 

Theorem 6. There is a metric space on which the competitive ratio of any deterministic online 
algorithm for the (k,l)-MSSMS problem is more than — {^l ^) • 

Proof. The required metric space is given by the set of points M = [I] x [k + 1] and the distance 
function d given by 

d((x,y),(x',y')) = if (x,y) = (x',y') 

= 1 if x = x and y ^ y' 
= D if x x 
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where D > 1 will be determined later. It is easy to check that d satisfies the triangle inequality. 
We will call the set of points {i} x [k + 1] the i'th cluster Mj. Thus, a pair of distinct points 
is separated by a distance 1 if they belong to the same cluster, and a distance D otherwise. 
Initially the servers occupy the points (1, 1), . . . , (1, k). 

Fix an online algorithm. The adversary forces the algorithm to perform poorly, as follows. 
At any time, the adversary picks, from each of the I clusters, the lowest numbered point which 
is not occupied by the algorithm's servers. It gives this set of points as the next request. Note 
that this is possible because each cluster has k + 1 points but the algorithm has only k servers. 
Since none of the points in this request set are occupied by the algorithm's servers, the algorithm 
incurs a nonzero cost to serve each request. We may assume without loss of generality that the 
algorithm is lazy, that is, it moves its servers only to serve requests, and thus, moves exactly one 
server for every requesto For the purpose of analysis, we divide the input into phases, based on 
the behaviour of the algorithm to serve the input. A phase ends whenever the algorithm moves 
one of its servers from one cluster to another, and a new phase begins from the next request. 

By definition of a phase, the number of the algorithm's servers in each of the I clusters does 
not change during a phase. We say that a phase is of type (A?i, . . . , ki) if the algorithm has ki 
servers inside the cluster Mj, during that phase. Thus, the zeroth phase is of type (k, 0, . . . , 0). 
Suppose the requests are given to the algorithm ad infinitum. We claim that if the algorithm 
has a finite competitive ratio, there must be infinitely many phases of each type. Suppose this 
is not the case. Then there exists a phase type k = (k\, . . . ,ki) such that there are only finitely 
many phases of type k and each of these phases is finite. The adversary serves the requests 
arbitrarily until the last phase of type k is over. Once the last phase of type k is over, the 
adversary places ki servers inside Mj at (i, 1), . . . , (i, ki) for each i permanently. Henceforward, 
the adversary always has more servers than the online algorithm, in some cluster. It is then able 
to serve all the future requests at zero cost, while the algorithm pays for each request. Thus 
the algorithm cannot have a finite competitive ratio. In particular, this claim implies that the 
algorithm must change phases infinitely often. 

The adversary runs many algorithms in parallel with the online algorithm. We will denote 
this number by h(k, I) and calculate it later. For each phase type k = {k\, . . . , k{), the adversary 
runs g{n) = Jl!=i(^ + 1) — 1 algorithms, each of which has ki servers in Mj, for each i, perma- 
nently. These algorithms maintain the following invariant. During a phase of type k' ^ n, all 
these algorithms place their ki servers on the points (i, 1), . . . , (i, ki) G Mj, for each i. As before, 
these algorithms have more servers than the online algorithm, in some cluster. Thus they serve 
the requests at zero cost. 

On the other hand, during a phase of type k, consider the sets of points M[ = {(i,j) \ j < 
ki + 1}, for i = 1 to There are n!=i(^ + 1) wa ys of placing k\ servers at distinct points in 
M[, &2 servers at distinct points in M' 2 , . . ., ki servers at distinct points in M[. Since the request 
sets in the phase are subsets of Uj=i ^[i * ne se * °f points S occupied by the online algorithm's 
servers corresponds to one of the n!=i(^« + 1) wa Y s - The adversary maintains the invariant, 

4 Thus is for convenience only. 

5 This is a standard argument; one can postpone the movements of a server to the instant when it gets a chance 
to serve a request, and this does not increase the distance traveled by the server. 
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that the configurations of its n!=i(^« + 1) — 1 algorithms of type k correspond to each of the 
other ways. Observe that the next request is ^lj!=i M^j \ S and thus, each of the adversary's 
type k algorithms serves the request at cost. However, whenever the online algorithm moves 
and incurs a cost 1, its configuration becomes equal to that of exactly one of the adversary's 
type k algorithms. In order to maintain the invariant, this type k algorithm executes a reverse 
move to attain the configuration S, and incurs a cost 1. 

Thus, for the requests which do not result in the beginning of a new phase, the cost incurred 
by the online algorithm is equal to the total cost incurred by all the adversary's algorithms. 
Now consider a request which results in a phase change from k to k! . The online algorithm 
incurs a cost D. Further, each of the g{n) + g(n') algorithms of the adversary, corresponding 
to the phase types k and k', has to shift at most 1 server within every cluster to maintain the 
invariant. Note that none of the adversary's algorithms ever shifts its servers across clusters. 
Thus the total cost incurred by the adversary's algorithms is I • (g(n) + g(n')) < I • h(k, I). We 
choose D > I ■ h(k,l), so that whenever a new phase begins, the adversary pays less than the 
online algorithm. 

Suppose the adversary stops giving requests after r phases, and this results in m requests. 
The cost incurred by the online algorithm is rD for the requests resulting in a phase change, and 
m — r for the requests not resulting in a phase change. Hence the cost incurred by the online 
algorithm over the entire request sequence is rD + m — r. 

Let us estimate the total cost of the adversary's h(k, I) algorithms. Each of the adversary's 
algorithms incurs a cost of at most kD to establish the invariant initially. Later, for the m — r 
requests not resulting in a phase change, the total cost of these algorithms is m — r and for the 
r requests resulting in a phase change, the total cost of these algorithms is at most rl ■ h(k,l). 
Thus the overall cost of all the adversary's algorithms, over all requests, is at most kD ■ h(k, I) + 
rl • h(k, I) + m — r. 

We choose r > kD ■ h(k, I)/ (D — I ■ h(k, I)) so that rD > kD- h(k, l) + rl- h(k, I) and thus the 
cost of the online algorithm is no less than the total cost of the adversary's algorithms. Further, 
since the adversary runs h(k,l) algorithms, one of these must incur a cost less than l/h(k,l) 
times the cost of the online algorithm. This proves a lower bound of h(k, I) on the competitive 
ratio of any online algorithm for (k, Z)-MSSMS. 

Finally, let us calculate h(k,l), the total number of algorithms run by the adversary. This 
number is given by 



E 

(fcl,...,fci):fclH \-ki=k 



lit** + x ) 



.i=i 



(k 1 ,...,k l ):k 1 +---+ki=ki=l 



E 



(fcl,...,fej):fclH \-k t =k 



Observe that the first term on the right hand side is the number of ways of placing 21 — 1 identical 
red balls and k identical green balls on a line, where fcj is the number of green balls between the 
(2i - 2) nd and the 2i th red balls from the left. Thus, we have 



E lite + 1) 

(k 1 ,...,k l ):k 1 +---+ki=ki=l 



k + 2l 

k 



1 
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The term ^(fa ki)-ki-\ vh=k ^ can ^ e easu y seen to be equal to [ k+l k x ). Thus the number of 

algorithms run by the adversary is 

□ 

5 The Offline problem 

We elaborate on the offline (k, /)-MSSMS problem in this section. Before that, we briefly describe 
the offline algorithms for the fc-server problem, and Metrical Service Systems. 

The problem of finding the optimal solution to an instance of /c-server problem can be 
reduced to the problem of finding a min-cost flow on a suitably constructed directed graph [8J . 
One way to construct the required directed graph is as follows. Let (M, d) be the metric space, 
let r_f c+ i, . . . ,ro G M be the initial positions of the k servers, and let p = (r%, ... ,r m ) be the 
requests, where S M. The directed graph has vertex set {s} U {ti-fc+i, . . . ,uo,u\, . . . , u m } U 
{vi, . . . , v m } U {t}. The edges are s — > Ui with cost for all —k + 1 < i < m; Ui — > Vj with cost 
d( r i, r j) for all —k + 1 < i < m, 1 < j < m, i < j; and Vj —> t with cost for all 1 < j < m. All 
these edges have capacity 1. The goal is to find a min-cost s-t flow of value m. Thus, the offline 
fc-server problem can be solved in polynomial time. Note that in the above instance of min-cost 
flow, even if we allow a fractional flow through the edges, there exists an optimal solution in 
which all the flows are integral. We will use this observation later. 

Further, the offline MSS problem can be translated to finding the shortest source-to-sink 
path in a suitably constructed directed graph of size linear in the size of the instance. Thus, 
MSS too can be solved in polynomial time. In fact, (k, /)-MSSMS can be solved in polynomial 
time for any constant k. However, the problem is NP-hard for any fixed / > 2, when k is allowed 
to vary. 

In the subsequent subsections we give a natural ILP for the offline (k, /)-MSSMS problem. We 
show that it has an infinite integrality gap. We however use it to design a pseudo- approximation 
algorithm which uses kl servers and incurs a cost of at most / times the optimum. We then 
prove a lower bound which seems to suggest that nothing better than our pseudo-approximation 
can be achieved in polynomial time. 

5.1 An Integer Linear Program Formulation 

For the metric space (M,d) let S = {s 1 , . . . , s k } be the set of initial positions of the k servers 
and let the request sequence be p = (R\, . . . , R m ) where Ri C M and Ri = {rj, . . . , r'}. A 
natural integer linear program is as follows. For each 1 < i < i' < m and 1 < < / we have a 
variable f(i,j, i' which is 1 if some server was present at r\ to serve Ri, and that server was 
next made to shift to rj, in order to serve R^] and otherwise. For each 1 < k' < I, 1 < i < m, 
1 < j < I we have a variable g(k',i,j) which is 1 if the A/'th server was first shifted to rj, to 
serve Ri, and otherwise. The constraints are as follows. 
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^2g{k',i,j) < 1 for each 1 < k' < k 

Y,9( k ',i,j) + /(*'./. ^ 2 f(hj,i",f) for each 1 < i < m, 1 < j < i 

fc' «'<i,j' i">i,j" 

^2d{k',i,j) + ^ f{i',j',hj) > 1 for each 1 < i < m 
fc'J i'<i,j',j 
f(i,j,i',f) e {0,1} for all g(k',i,j) € {0,1} for all A/,i, j 

The objective is to minimize the following function. 

Y,d{s k \rl)g{k'^j)+ £ d{ri,4)f{i,j,i',j') 

k',i,j i,j,i'J' 

To relax the above ILP to an LP, we replace the constraints f(i, j, £ {0, 1} for all i, j, 
and g(k',i,j) € {0,1} for all k',i,j by the constraints < f{i,j,i',j') < 1 for all i,j,i',f and 
< g(k',i,j) < 1 for all k',i,j respectively. Unfortunately, this LP relaxation has unbounded 
integrality gap. 

To see the unbounded integrality gap, consider the uniform metric space on the kl + k points 
{— k + 1, . . . , 0, 1, . . . , kl} and assume that the servers are located at the points — k + 1, . . . , 
initially. Let Ri, . . . , Rrki\ be an arbitrary ordering of the subsets of {1, ... , kl} of size I. The 
request sequence is constructed by repeating this sequence m times. This request sequence can 
be fractionally served as follows. Initially a l/l fraction of servers shifts to each of the kl points 
l,...,kl and this costs k. Having done this, all requests can now be served at zero cost. Thus 
the optimum of the LP relaxation is at most k. 

However, an integer solution must have cost at least (k— 1)(Z— 1) on the sequence R\,. . . , R/ki\ , 
starting with any initial server configuration. This is justified as follows. Let Sq be the initial 
server configuration, and Si be the configuration after Ri is served. Let U = So U ■ ■ ■ U S^m-j . 
Then U is a hitting set for the sequence R±, . . . ,R/m\. But any hitting set of this sequence 
must have size at least kl — I + 1, since the sequence contains all /-subsets of {1, ... , kl}. Thus 
\U\ > kl — I + 1. Next, U = S U (Si \ S ) U (S 2 \ Si) U • • ■ U (Snas \ 5 (>=/)_i) and therefore 

|So| + | Si \ So| + |S 2 \ Si| + • ■ ■ + S^w-j \ S/thj > kl — I + 1. This implies that the cost of 
serving the R±, . . . , Rsu\ is at least kl — I — 1 — |So| = kl — I + 1 — k = (k — 1)(Z — 1). 

Therefore the total cost of the ILP is at least m(k — 1)(7 — 1). Since m can be chosen to be 
arbitrarily large, the ILP has infinite integrality gap when both k and I are more than 1. 

5.2 A Pseudo-approximation algorithm 

The ILP mentioned in the previous subsection can be used to get a pseudo-approximation 
algorithm for MSSMS. This algorithm computes a feasible solution to a given instance of (k,l)- 
MSSMS using kl servers instead of k, and whose cost is at most I times the cost of the fractional 
optimal solution (and hence the cost of the integer optimal solution with k servers). 

Given an instance of (k, Z)-MSSMS, let the vector (f*,g*) be I times the optimum (fractional) 
solution of the LP relaxation. Then for each i, we have Y^k' j 9*(^' '> ^i)+Si'<? j' j /*(*' 'if ' 1 h j) — 
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I. Therefore, for each i, there exists some ji such that ^ fc , g*(k' ,i, ji) + Si'<j y /*(*' > / ih 3i) — 1- 
Let r* = r;? 1 . We treat r*, . . . , as an instance of the /d-server problem, compute the optimum 
solution and return it. Note that this is a feasible solution. To analyze its cost, observe that the 
vector (f*,g*) gives a (fractional) solution to the instance r[, . . . , of the kl-seivei problem. 
Hence the cost of the optimum solution to this instance is no more then the cost of (f*,g*), 
which is in turn, I times the cost of the optimum fractional solution to the LP relaxation of the 
original instance of (k, /)-MSSMS. Thus, the solution returned by our algorithm uses kl servers, 
and has cost at most I times that of the optimum. 

5.3 Hardness of Pseudo- Approximation 

In this section, we prove the following hardness result, which essentially implies that nothing 
better than what the pseudo-approximation algorithm does, can be achieved in polynomial time. 

Theorem 7. Assuming the Unique Games Conjecture [19j, it is NP-hard to pseudo-approximate 
the (k, l)-MSSMS problem, for any fixed I > 2, on the uniform metric space with n points, within 
any factor polynomial in n, using less than kl servers. 

We will use the following hardness result due to Khot and Regev [[201 . 

Theorem 8. Assuming the Unique Games Conjecture, it is NP-hard to approximate the size of 
the vertex cover of I -uniform hypergraphs within a factor less than I. 

Proof of Theorem [?| Suppose there is a polynomial time algorithm A which, on a metric space 
with n points, can pseudo-approximate the solution of an instance of (k, Z)-MSSMS, using K < kl 
servers, within a factor f(n) = 0(n p ) for some constant p independent of k. We use this 
algorithm to construct a polynomial time algorithm B which, when given a /-uniform hypergraph 
having a vertex cover of size k, outputs a vertex cover of the hypergraph of size K. This 
contradicts Theorem [HI unless P=NP. 

Algorithm B does the following. Suppose it is given a hypergraph H = (V, E) where \ V\ = n, 
E = {Ei, . . . , E m }, Ei C V \Ei\ = I, and an integer k, with the promise that H has a vertex 
cover of size k. B constructs the uniform metric space on the set VtfclW where W = {wi, . . . , Wk}- 
Then it constructs an instance of (k, Z)-MSSMS, where one server is initially placed at each of the 
Wi's and the request sequence is Ei, . . . ,E m repeated more than kf(n + k) (but polynomially 
many) times. We will call each repetition of the sequence Ex,-- - ,E m a phase. B then uses 
algorithm A to find a pseudo-approximate solution of this instance, using K < kl servers. 

Since the hypergraph has a vertex cover V' C V of size k, the requests can be served by 
shifting the servers to the points in V 1 once for all, at a cost k. The cost of the solution returned 
by algorithm A is therefore at most kf(n + k), which is less than the number of phases. Thus, 
there exists a phase in which A incurs zero cost, which means that A does not shift server during 
this phase at all. Since all the hyperedges are requested in each phase, the set of points occupied 
by the K servers in this phase must be a vertex cover of H, with size at most K. 

Having obtained a solution from algorithm A, B simply searches for a phase in which A 
incurred zero cost, and returns the set of K points occupied by the servers during this phase, 
as an approximate vertex cover of H. It is easy to see that since A runs in polynomial time, so 
does B. □ 
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Theorem [7] can be made independent of the validity of the Unique Games Conjecture, as 
follows. 

Theorem 9. It is NP-hard to pseudo-approximate the (k,l)-MSSMS problem, on the uniform 
metric space with n points, within any factor polynomial in n, and using less than k(l — l) servers 
for I > 2, and less than 1.36/c servers for I = 2. 

This can be proved using the following two results (the former due to Dinur et. al. |12j and 
the latter due to Dinur and Safra [13]) in place of Theorem [8] for the cases I > 2 and I = 2 
respectively. 

Theorem 10. It is NP-hard to approximate the size of the vertex cover of l-uniform hypergraphs 
within a factor less than I — 1. 

Theorem 11. It is NP-hard to approximate the size of the vertex cover of (usual) graphs within 
a factor less than 10\/5 — 21 ~ 1.36. 

Remark. The above theorems imply, that unless P=NP, there cannot exist a competitive online 
algorithm for MSSMS, which does a polynomial amount of computation per request. This rules 
out the competitiveness of several natural algorithms one can think of. 

6 Approaches that fail 

In this section we discuss a few popular techniques, that produced good algorithms for the k- 
server problem and metrical service systems, but which, in case of MSSMS on arbitrary metric 
spaces, either do not give competitive online algorithms, or their analysis involves more technical 
complications. 

6.1 The Harmonic Algorithm 

The Harmonic algorithm is a natural memoryless algorithm and was among the first studied 
randomized algorithms for the fc-server problem. In order to serve a request at a point y when 
the servers are at x%, . . . , X}~, the algorithm moves the server located at Xi to y with a probability 
inversely proportional to d(xi,y). This algorithm has a competitive ratio of Oik ■ 2 k ) \17\ H]. 
The natural analog of this algorithm for the MSSMS problem would be the algorithm which, 
when given a request {yi, . . . ,yi} with the servers at x±,. . . ,Xk, moves the server at xi to yj 
with probability (d(xi, yj))' 1 / T,i,j( d ( x ii Vj))~ X - 

The Harmonic algorithm, however, does not work even for metrical service system on the 
line. To see this, consider the metric space of integer points on the line. Let k = 1, 1 = 2 and 
let the server be initially located at 0. For i = 1 to m, let the i'th request in the sequence be 
{— and let the m + l'st request be { — 1}. The optimal solution is to move the server to 
— 1, and this costs 1. However, it is easy to verify that with probability l/(i + 1) the server 
is at i after the ith request. This implies that the interval [i — is traversed twice with a 
probability l/(i + 1), and not traversed at all with a probability i/{i + 1). The interval [—1,0] 
is traversed once, with probability 1. The expected cost incurred by the algorithm is thus 
l + 2Ef=i l/(i + l) = 2fl ro -l. 
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6.2 The Work Function Algorithm 

The technique of work functions has given the best known deterministic online algorithms, both 
for the &;-server problem as well as MSS. It seems to be the most promising algorithm for the 
MSSMS problem too. The work function algorithm can be described as follows. 

Let (M, d) be a metric space. We can think of this metric space as a complete edge-weighted 
graph on the vertex set M, where the weight on the edge (x, y) is d(x, y). A server configuration is 
a multiset of server positions on M. Thus, in case of the /c-server problem, a server configuration 
is a multiset of k points from M, whereas, in case of the MSS problem, a server configuration 
is a single point. Given server configurations X and Y , let d{X, Y) denote the minimum cost 
that needs to be paid, in order to change configuration X to Y. In other words, d(X, Y) is the 
weight of the minimum weight matching between the sets X and Y. It is easy to see that with 
this definition, d is a metric on the set of server configurations. 

Let Xq denote the initial server configuration and p = R2, ■ ■ .) be the sequence of requests 
in an instance of the problem under consideration (/c-server, MSS or MSSMS). Let pi denote the 
prefix of p with the first i requests. For any sequence p' of requests, the work function uy is a 
function on the set of server configurations. The value of this function at a server configuration 
X is defined to be the minimum cost of starting from Xq, serving the request sequence p' , and 
ending up with the configuration X. Thus, if w is the work function after the entire sequence 
of requests, then the minimum value of w(X), over all configurations X, is the optimum cost of 
serving the requests. 

The work function algorithm (WFA) works as follows. Let u?j = w Pi . Suppose that the 
online algorithm has the server configuration after serving the first i requests. In order to 

serve the request Ri, the algorithm changes the server configuration to X{, which serves Ri, and 
which is such that Wi(Xi) + d(Xi—i, Xi) < Wi(Y) + tf(Xj_i, Y) for every server configuration Y. 
(It can be shown that such an Xi must exist.) This algorithm achieves a competitive ratio of 
2k — 1 for the /c-server problem |22j . while a variant of this algorithm is Oil • 2')-competitive 
for MSS with I points per request [7]. It is therefore, natural to expect that WFA for MSSMS 
will achieve some competitive ratio, which is a function of k and I only. Below, we explain the 
roadblocks encountered in our attempts to prove this claim along the lines of [7j and |22j. 

It can be easily shown that for any work function w, and configurations X and Y , we 
have w(X) < w(Y) + d(X,Y). A configuration Y is said to dominate a configuration X in w if 
w{X) = w(Y)+d(X, Y). The support of a work function is defined to be the set of configurations 
X such that X is not dominated by any other configuration Y, in w. It is also easily seen that 
each such X must serve the last request in the sequence which results in the work function w. 
In case of MSS with I points per request, the size of the support of any work function is at most 
I, and in fact, the support is a subset of the last request. The proof of the competitive ratio of 
WFA, by Burley [7J, uses this fact crucially. This property, however, does not remain true for 
MSSMS. The counterexample is as follows. Let k = I = 2, the metric space be uniform, and 
the initial configuration be {xq,x' }. For i = 1 to m, let the i'th request be {xi,x'j}, where the 
points xq, x , xi, x'i, ■ ■ ■ , x m , x' m are all distinct. It is easy to verify that the support of the work 
function after this sequence contains all 2-element subsets of {xq, x' , x\,x\, . . . ,x m ,x' m }, which 
contain either x m or x' m . Thus, the size of support of a work function for MSSMS could depend 
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on the number of points in the metric space, even for fixed k and I. It is, therefore, unlikely 
that the ideas in Burley's proof can be easily extended for MSSMS. 

In the case of fc-server problem, the proof of the competitive ratio of WFA, by Koutsoupias 
and Papadimitriou [22], depends on a property of work functions, which they call quasi- convexity. 
The quasi-convexity property states that for any work function w p arising from an instance p of 
the fc-server problem, and any two configurations X and Y, there exists a bijection g : X — > Y, 
such that for any X' C X, w(g(X') U (X \ X 1 )) + w(X' U g(X \ X')) < w(X) + w(Y). This 
property is also not true for work functions in case of MSSMS. As a counterexample, consider 
the uniform metric space. Let the initial configuration be {1,2} and let the request sequence 
be {3, 4}, {5, 6}, {3, 6}, {5, 4}. If w is the work function after this sequence, then itf({3, 5}) = 
w({4,6}) = 2 and the value of w at any other configuration is more than 2. Thus w is not 
quasi-convex. It would, therefore, be interesting to investigate whether there exists an analog 
of quasi-convexity for I > 1. 

7 Open Problems 

We mention a number of interesting open problems. We begin with the most important problem 
left open. 

Problem 1. Design an f{k, Z)-eompetitive deterministic / randomized algorithm for (k,l)- 
MSSMS on arbitrary metric spaces, for some function /. 

For (k, Z)-MSSMS on arbitrary metric spaces, we do not have an online algorithm with 
competitive ratio determined by k and I alone. We believe that such an algorithm exists, and 
the Work Function Algorithm is the prime candidate for the deterministic case. We know 
that if such a deterministic algorithm exists, it must perform a super-polynomial amount of 
computation on the input. It would be interesting even to find a constant factor competitive 
algorithm for (2, 2)-MSSMS on special metric spaces, such as the line. 

The lower bound, that we obtain in Section [H on the competitive ratio of such a deterministic 
algorithm is polynomial in I for a fixed k, and vice versa. We believe that a better lower bound 
exists, which is exponential in I for a fixed k, and at least polynomial in k for a fixed I. Such a 
lower bound can possibly proved by a construction which combines our ideas and the ideas in 
the Q(2 l ) lower bound proof for MSS by Fiat et. al. [15j . In the randomized case, we believe 
that a lower bound which is polynomial in I and logarithmic in k should hold. 

Problem 2. Prove that the competitive ratio of any deterministic (resp. randomized) algorithm 
for (k, Z)-MSSMS on arbitrary metric spaces must be £l(k)™ 1 ' (resp fi(poly(Z) • log A;)). 

For deterministic algorithms on uniform metric spaces our upper and lower bounds on the 
competitive ratio differ by the factor k. We believe that the upper bound can be improved by 
carefully choosing the hitting sets, in the Hitting Set algorithm, described in Section[2j Similarly, 
there is a significant gap between the upper and lower bounds in case of randomized algorithms. 
In all the upper bound (resp. lower bound) results we have made the conservative assumption 
that whenever the algorithm (resp. adversary) faults, it can potentially shift all the k servers 
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and hence the cost incurred is k, while the adversary (resp. algorithm) shifts at most one server 
per fault. This assumption introduces a gap of a factor of k between the bounds. 

Problem 3. For (k, ^)-MSSMS on uniform metric spaces, close the gap between the lower bound 
of ( k f) - 1 and the upper bound of k- (^( k ^ 1 ) — 1^ in the deterministic case, and the lower bound 
of f2(log kl) and the upper bound of 0(/c 3 log /) in the randomized case. 
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